Toward performance-portable PETSc for GPU-based exascale systems

نویسندگان

چکیده

The Portable Extensible Toolkit for Scientific computation (PETSc) library delivers scalable solvers nonlinear time-dependent differential and algebraic equations numerical optimization. PETSc design performance portability addresses fundamental GPU accelerator challenges stresses flexibility extensibility by separating the programming model used application from that library, it enables developers to use their preferred model, such as Kokkos, RAJA, SYCL, HIP, CUDA, or OpenCL, on upcoming exascale systems. A blueprint using GPUs PETSc-based codes is provided, case studies emphasize high achieved current GPU-based

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A performance-portable generic component for 2D convolution computations on GPU-based systems

In this paper, we describe our work on providing a generic yet optimized GPU (CUDA/OpenCL) implementation for the 2D MapOverlap skeleton. We explain our implementation with the help of a 2D convolution application, implemented using the newly developed skeleton. The memory (constant and shared memory) and adaptive tiling optimizations are applied and their performance implications are evaluated...

متن کامل

Towards Performance Portable GPU Programming with RAJA

High-performance machines of today are already heterogeneous in nature with traditional multicores and accelerators such as GPUs and Xeon PHIs. It is a challenge to program these machines in a performance portable way with code that is maintainable across large production applications. RAJA is a recently introduced C/C++ programming approach targeting fine-grained parallelism in loops that is i...

متن کامل

Toward Exascale Resilience

Over the past few years resilience has became a major issue for HPC systems, in particular in the perspective of large Petascale systems and future Exascale ones. These systems will typically gather from half a million to several millions of CPU cores running up to a billion of threads. From the current knowledge and observations of existing large systems, it is anticipated that Exascale system...

متن کامل

Contemporary High Performance Computing - From Petascale toward Exascale

Exascale focuses on the ecosystems surrounding the world s leading centers for high performance computing (HPC). It covers many of the important factors...

متن کامل

Technologies for exascale systems

exascale systems P. W. Coteus J. U. Knickerbocker C. H. Lam Y. A. Vlasov To satisfy the economic drive for ever more powerful computers to handle scientific and business applications, new technologies are needed to overcome the limitations of current approaches. New memory technologies will address the need for greater amounts of data in close proximity to the processors. Three-dimensional sili...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Parallel Computing

سال: 2021

ISSN: ['1872-7336', '0167-8191']

DOI: https://doi.org/10.1016/j.parco.2021.102831